Differentiating the entropy of random walks on hyperbolic 

groups 



P. Mathieu * 



Abstract 

We show that the asymptotic entropy of a random walk on a non-elementary hyper- 
bolic group, with symmetric and bounded increments, is differentiable and we identify its 
derivative as a correlation. We also prove similar results for the rate of escape. 



1 Introduction 

Let T be an infinite, countable, discrete group with neutral element id, and let /i be a probability 
measure on T. The entropy of /i is defined as 

#0) : = ^(-log/^OMz) ■ 

Note that H(fi) is non-negative and may be infinite. 
Let fi n denote the n-th convolution power of \i. 

We assume that ti{p) < oo. It is easy to check that the sequence (fz"(/i n )) ne N is subadditive 
so that the following limit does exist: 

h(/i) := lim -H{fi n ) . (1.1) 

n— s>oo n 

The quantity h(fi) is called the asymptotic entropy of fi. 

The notion of asymptotic entropy was introduced by A. Avez in [2] in relation with random 
walk theory. In [3], Avez proved that, whenever /i(/i) = 0, then fi satisfies the Liouville property: 
bounded, /i-harmonic functions are constant. The converse was proved later, see [7] and [15]. 

Consider the random walk on T whose increments are distributed according to fi. The 
Liouville property is equivalent to the triviality of the asymptotic cx-field of the random walk 
(its so-called Poisson boundary), see [7] and [15] again. In more general terms, the entropy 
plays a central role in the identification of the Poisson boundary of random walks in many 
examples. We refer in particular to [14] for groups with hyperbolic features. In this latter 
case, the asymptotic entropy is also related to the geometry of the harmonic measure through 
a 'dimension-rate of escape- entropy' formula, see [6] and the references quoted therein. 
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The question of the regularity of h as a function of [i was raised by A. Erschler and V. 
Kaimanovich in [9], where it is proved that, still for hyperbolic groups, the asymptotic entropy 
is continuous. If we restrict ourselves to measures \i with fixed finite support, F. Ledrappier 
recently proved in [17] that h is Lipschitz continuous. We shall complement the result of [17] 
showing that, for a hyperbolic group T and restricting ourselves to symmetric measures with 
a fixed finite support, the asymptotic entropy is differentiable (Theorem 2.2). 

There is an analogy between the definition of h and the definition of the rate of escape 
of the random walk in some left-invariant metric. More precisely, it can be proved that h(fi) 
coincides with the rate of escape of the corresponding random walk in the so-called Green 
metric, see the definition in part 2. We also give sufficient conditions on a metric ensuring that 
the rate of escape is differentiable (Theorem 2.1). 

Our approach completely differs from the one in [17]. We start with the simple observation 
that the derivative of the mean position of a random walk is a correlation, see part 2.3. Thus 
the natural candidate to be the derivative of the rate of escape is some asymptotic covariance. 
These heuristics suggest a close connection between the differentiability of the rate of escape 
and the Central Limit Theorem and explain the statement of Theorem 2.1. 

As for the entropy, one deduces the differentiability of h and the value of its derivative from 
Theorem 2.1 when choosing the right metric (namely the Green metric) and observing that 
fluctuations of the Green metric are of lower order - a fact that follows from the 'fundamental 
inequality' between entropy and rate of escape and which is true for random walks on non- 
amenable groups in general, see part 4. 

The version of the Central Limit Theorem we need is a straightforward extension of [4]. We 
also rely a lot on the 'Green metric machinery' developed in [6]. 

Let us emphasize that we do not claim that our results are optimal. It is quite possible that 
the entropy and rate of escape are much more regular that differentiable. It is actually known 
that the entropy and rate of escape are analytic on the free group [16] and that the rate of 
escape is analytic for some Fuchsian groups [12]. One may hope analityticity holds for general 
hyperbolic groups (although it does not hold for all groups, see [18]). Anyway, we believe the 
interpretation of the derivative as a correlation is rather satisfactory, at least from an intuitive 
point of view. It clarifies the connection between the regularity of the rate of escape and the 
Central Limit Theorem, an observation that seems to be new in our context. 

Let us finally mention that the interpretation of the derivative of a steady state (whatever it 
may mean) as some kind of correlation is a well known idea in theoretical physics or dynamical 
systems, where it is sometimes called 'linear response theory' or 'fluctuation-dissipation theory'. 
See [19] and other papers of the same author. 

2 Definitions and results 
2.1 Definitions 

Let r be an infinite, countable, discrete group with neutral element id. Let d be a left-invariant 
proper metric on T. We assume that T is finitely generated and that d is equivalent to a word 
metric. 

When the context is clear, we may also use the notation \x — y\ to denote the distance 
between x and y and \x\ — \x — id\. 
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We define the Gromov product of points x, y G T with respect to the base point weTby 

(x,y) w := ^(\x-w\ + \y-w\- \x - y\) . 

We recall that the distance d is called hyperbolic if there exists a constant r such that 

(x, y) w > min{(a;, z) w , (z, y) w } - r , (2.2) 

for all x, y,z,w G T. 

The group T is called hyperbolic if any (equivalently some) word metric is hyperbolic. 
A hyperbolic group is called non-elementary if it is non amenable (which turns out to be 
equivalent to requiring T is not a finite extension of Z). 

We refer to [10] for background material on hyperbolic groups. 

From now on, we will assume that T is hyperbolic and non-elementary. 

Following [6], we let T>(T) be the set of left-invariant proper metrics on T which are both 
equivalent to a word metric and hyperbolic. Note that these last two conditions are not redun- 
dant. Indeed there always exist non-hyperbolic (non-geodesic) metrics equivalent to any word 
metric on T, see [6], Proposition 2.3. There may also exist hyperbolic metrics on T that are 
not equivalent to a word metric. 

We shall consider the following two compactifications of T. The visual (Gromov) com- 
pactification is obtained by considering all infinite sequences in T and identifying two such 
sequences, say (x n ) and (y n ), if the Gromov product (x n ,y n ) w tends to infinity. The horo- 
function (Busemann) compactification is constructed by identifying a point x G V with 
the horo- function h x : T — > R, h x (y) = \y — x\ — \ w — x\ and taking the closure for the topology 
of pointwise convergence. By Ascoli's theorem, this is indeed a compact. The group T acts by 
homeomorphisms on both compactifications. Up to equivariant homeomorphisms, the Gromov 
compactification does not depend on the choice of either the base point w or the choice of 
d G T>(Y). The Busemann compactification is also independent of the choice of the base point 
but not of the choice of the metric. We shall say that d satisfies Assumption (BA) if, up to 
equivariant homeomorphisms, the Gromov and Busemann compactifications coincide. 

Assumption (BA) is in particular satisfied by the class of metrics called Green metrics. 
These are constructed as follows. We call a probability measure /ionT 'symmetric' if ^(x' 1 ) = 
n{x) for all x G V. The support of ji is the set of x G V such that /z(x) is not zero. 

Let /i be a probability measure on V . We assume that fi is symmetric and that the support 
of \i is finite and generates the whole group T. The Green function associated to /i is defined 
by 

oo 

?1=0 

where \i n is the n-th convolution power of /i. 

We assumed that T is non-amenable. Therefore the sequence fj, n (x) exponentially converges 
to zero so that the series defining G M does converge. The Green distance between points x and 
y in T is then 

d&x,y) := logG"(id) - logG^x-'y) . (2.3) 
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In [6], we proved that <1q belongs to T>(T). Observe that cIq need not be geodesic. (As a 
matter of fact, it is not so difficult to deduce that d,Q is equivalent to a word metric from the 
non-amenability of V. That d G is hyperbolic is equivalent to a certain multiplicativity property 
of the Green function which is the content of Ancona's classical - and difficult - theorem, 
see [1] and the proof in [21]). 

We now give the definition of the random walk associated to a probability measure fi 
on T. Because we will eventually use Radon-Nikodym transforms, it will be more convenient 
to work with the canonical construction on the set of trajectories on T, say Q = T N , where 
N = {0, 1, ..}. Given u = (uq, uoi, ...) G f2 and n G N, we define the maps Z n and X n from Q 
to T by Z n (bS) := u n , and X n (u) := (Z n _i(u))~ l Z n (u) . Thus Z n (u) gives the position of the 
trajectory u> at time n, while X n (u>) gives its increment also at time n. Following the usual 
usage in probability theory we often omit to indicate that random functions, as Z n or X n , 
depend on uj. 

We equip Q with the product a -field (i.e. the smallest a-field for which all functions Z n are 
measurable). The law of the random walk with increments distributed like fi is, by definition, 
the unique probability measure on Q under which Z = id and the random variables (X n )„ GN 
are independent and distributed like //. We denote it with P M . We also use the notation W to 
denote the expectation with respect to P M . Observe that the law of Z n under P M is /i™. 

We recall that, given a probability measure \i with finite support and given a left-invariant 
metric d G T>(T), the rate of escape of \i in the metric d is defined by 

£(fi;d) := lim — \^d(id,x)fi n (x) 

n-toc n — ' 

and the asymptotic entropy of ji is defined by 

h(fi) := lim -V(-log^(x))^(x). 

n— >oo TL 

Kingman's sub-additive theorem implies that 

£(fj,;d) = lim —d(id,Z n ) and h(jj) = lim \ogfi n (Z n ) 

n-^oo n n->oo n 

where both limits hold P M almost surely as well as in P M ). 
2.2 Differentiability of £ and h 

In the sequel we shall study the derivatives of the rate of escape and the entropy of probability 
measures with a fixed finite support. A subset of T, say S, is 'symmetric' if x _1 G S whenever 
x G S. Let S be a finite symmetric subset of T. We assume that S generates the whole group T. 
Let V S (S) be the set of symmetric probability measures with support equal to S. Then V S (S) 
naturally identifies with an open subset of M. d for some d. We use the differentiable structure 
it inherits this way. 

Regularity Assumption: throughout the paper, we shall assume that the function A G 
[—1,1] — > log H\(a) has a derivative at A = for all a G S. Equivalently, we may write a first 
order expansion of log (J,\(a) as A tends to in the form 

log fi x (a) = log^o(a) + \v(a) + Ao A (a) , (2.4) 
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where u(a) is the derivative of the function A — > log fi\(a) at A = and o\(a) converges to 0. 

Observe that since S is finite, this is equivalent to requiring 0\(a) to converge to uniformly 
with respect to a G S. We shall also repeatedly use the fact that v is bounded. 

We shall use the shorthand notation P A (resp. E A ) instead of P MA (resp. E MA ). 

From the condition that \i\ is a probability measure, one deduces that we must have 



^2is(a)/j, (a) = 0. 



aeS 

We define the sequence M = and, for n > 1, 

n 

M B = J>(A,-)- 
i=i 

Note that the random process (M n )„ gN is a centered martingale under P°. 

Let d G V(T) and assume assumption (BA) holds. We shall see in Proposition 3.2 that the 
sequence (\Z n \,M n ) satisfies a joint central limit theorem and that the asymptotic covariance 
of \Z n \ and M n is given by 

a(u,fi -d) := lim -E°[|Z n |M n ] . (2.5) 

n— >oo fi 

Theorem 2.1 Let d e T>(T) satisfy (BA). Then the function A — > l(fi\;d) is differentiate 
and its derivative satisfies 



d_ 

dX 



e(fi\;d) = cr(i/,fi ;d) . (2.6) 

A=0 

Theorem 2.2 The function A — > h(/i\) has a derivative at A = which satisfies 

h(nx) = o-(v, no', d o) > ( 2 - 7 ) 



d 

dX 



A=0 

where := d^ is the Green metric associated to the probability 

In [5] , we showed that the asymptotic entropy coincides with the rate of escape in the Green 
metric: 

%) = e(jM; d£) 

for all pi G V S (S). Thus, using the result in Theorem 2.1, we may reformulate (2.7) as follows: 
lim \(h(vx) - hfa)) = lim \(£(^; d%) - efa x] <f G )) . 

A^O A A^O A 

In other words, as far as first order terms are concerned, the fluctuations of the Green metric do 
not contribute to the fluctuations of the entropy. This turns out to be a quite general statement 
for random walks on non-amenable groups, see part 4. 
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2.3 Heuristics 



We give a simple - but not completely rigorous - way to guess why formula (2.6) should 
hold true. We provide these heuristics in order to clarify the scheme of the proofs, with the 
hope that this scheme can be adapted to other examples of random walks. 

To compute the rate of escape, observe that 

E x [\Z n \]=E°[\Z n \U] =l ^^}. (2.8) 
Taking the derivative in (2.8), we get that 

n 

E x [\Z n \] =E°[\Z n \Y t v(X j )] =E°[\Z n \M n ]. (2.9) 

A=0 j=1 

Thus we see that a reasonable candidate to be the derivative of £([i\; d) is the limit of 
±E°[|Z n |M n ] as n tends to +oo. 

Observe however that in order to turn this loose argument into a proof, one needs justify 
how to exchange the order between the limit in n and the derivation in A. 

On the one hand, we shall rely on a quantitative version of the law of large numbers for 
\Z n \ to show that the derivative of £(/i\;d) is well approximated by the limit of the ratio 
(E A [|Z n |] — E°[|Z n |])/An as soon as A tend to 0+ and n tend to +oo in such a way that 
the product An tends to +oo, see Lemma 3.1. This follows from the fact that the function 
n — > E x [\Z n \] is almost additive, uniformly in A. 

Thus it is sufficient to describe the limit of E A [|Z n |]/An when An — > +oo. We will actually 
choose A and n such that \^/n tends to 1. Then E A [|Z n |]/An ~ E A [|Z n |]/y / n, which is the 
scaling of the Central Limit Theorem. 

More precisely, we show a joint C.L.T. for the random vector (\Z n \,M n ) under P°, see 
Proposition 3.2. Let o-(v, fi ; d) denote the asymptotic covariance of \Z n \ and M n . 

Consider the Girsanov weight H™ =1 H\{Xj)/ /i (Xj) in formula (2.8). Up to error terms 
of smaller order, it coincides with the exponential (in the sense of martingale theory) of the 
martingale (AM n ) neN . With our choice of the scaling between A and n, the asymptotic of \M n ~ 
M n / \fn is given by the Central Limit Theorem. Therefore the limit of the Girsanov weight is 
of the form e M ~^ M ^ for some Gaussian random variable M. Moreover, provided we can check 
some integrability conditions, the joint C.L.T. implies that the limit of E A [|Z n |] — n£(fi Q ;d)]/\n 
is then of the form E[Ze M_ 2 E [ M2 ]] ; where (Z, M) is a Gaussian vector with covariance E[ZM] = 
(j(z/, /i ; d). The integration by parts formula for Gaussian laws implies that (for any Gaussian 
vector) 

E[Ze M ~^ E[M2] ] = E\ZM] . 

The next Theorem summarizes the part of this argument we just sketched that does not 
explicitly use the hyperbolicity of Y. 

Theorem 2.3 Let Y be a finitely generated group; S a finite symmetric generating set; d a 
left-invariant proper metric on Y and A G [—1, 1] — >■ /i\ G V S (S) be a curve in V S (S) satisfying 
the Regularity Assumption. We further assume that: 
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(i) the joint Central Limit Theorem holds for the vector (\Z n \,M n ) under P° with asymptotic 

covariance a, 

(n) 

sup -E°[(|Z n | - n£(fi ; d)) 2 } < +00 . 

n n 

Then we have 

M ^ X [\Z n \]-^[\Z n \]) = a, 

n-t+oo,\-tO AH 

along any sequence A such that lim sup n ^ +00 \ 2 n < +00. 
Theorem 2.3 is proved in section 3.3. 

3 Proofs of Theorems 2.1 and 2.3 

The proofs are organized in the following way: in part 3.1, we recall some estimates on the 
mean distance E M [|Z n |] from [6] and use them to show that Theorem 2.1 can be deduced from 
Theorem 2.3. In part 3.2, we recall results from [4] and show how they imply a slightly stronger 
version of the assumptions of Theorem 2.3. Parts 3.1 and 3.2 use the hyperbolicity of T is an 
essential way. 

In part 3.3 we prove Theorem 2.3. Part 3.3 can be read independently of the preceding 
ones. 

Let d G T>(T) satisfy assumption (BA). We will show that lim A ^ iW/-^; d) — £(fio]d)) 
exists and equals <x(z/, fi ; d). In the proof it will be convenient to restrict ourselves to positive 
A's. This is no loss of generality since a(u,fi ; d) is linear in v. 

3.1 Geometric input 



Lemma 3.1 Let A tend to 0+ and n tend to +oo in such a way that the product An tends to 
+oo. Then 

l(/i A ^)-lQ»o;d) E A [|Z ra |]-E°[|Z»|] 
A An 

tends to 0. 

Proof: Let n G V S (S). 

The triangle inequality implies that the sequence a(n) := E M [|Z„|] is sub-additive. Therefore 
we have £(fi; d) = inf n and thus 

W[\Z n \]>ne(fi;<t) (3.10) 

for all n. 

We need a similar upper bound. It will follow from bounds on the lateral deviation of a 
trajectory of a random walk. Let us recall some results from [6]. 
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In [6], Proposition 3.8, we showed that, for any fi G V S (S), there exists a constant To such 
that for all integers m, n, k, 

W[(Z m , Z m+n+k ) Zm+n ] < r . (3-11) 

Applying this inequality with m = and using the fact that E M [|Z n+fc — Z n \] = W[\Z k \], we 
get that 

W[\Z n \] + E"[|Z fc |] < 2r + W[\Z n+k \] . (3.12) 

Thus the sequence b(n) := 2r — E M [|Z„|] is also sub-additive. Note that b(n)/n converges to 
d). As above this implies that 

E"[|Z n |] <n£(ii;d) + 2T (3.13) 

for all n. 

Combining (3.10) and (3.13), we see that we have proved that 

m\Z n \]-n£( f i;d)\<2r (3.14) 

for all n. 

A close inspection of the proof of (3.11), reveals that the constant To is locally uniform in 
V S (S) so that we may apply (3.14) with the same constant To to all measures fx\ for A in a 
small enough neighborhood of 0. The statement of Lemma 3.1 immediately follows. ■ 

Let us choose A tending to 0+ and n tending to +oo such that \ 2 n tends to I. Thus Lemma 

3.1 applies. In order to complete the proof of Theorem 2.1, it only remains to show that 

lim — [Ulll. = a {y^ ^ d) . (3.15) 

\,n ATI 

3.2 Central Limit Theorems 



In this part of the paper we recall some results from [4] on the Central Limit Theorem for 
\Z n \ and discuss their extension to a joint C.L.T. for (\Z n \, M n ). 

Let d G T>(T) satisfy assumption (BA). Let ji G V S (S) be a finitely supported symmetric 
probability measure on T. 

Mimicking the situation of the discussion preceding Theorem 2.1, we also let v be a real 
valued function defined on S and satisfying the centering condition: ^ aeS v{a)^i{a) = and 
consider the sequence of random variables M = and, for n > 1, 

n 

M n = J>(^). 

j=i 

Proposition 3.2 . 

(i) The law of the two-dimensional random vector {{\Z n \ — rU(fX]d))/y/n,M n /y/n) under 
weakly converges as n tends to +oo to a centered Gaussian law with some covariance matrix 
W. 

(ii) The covariance matrix of {{\Z n \ — n£(/i; d))/y/n, M n /y/n) under P M converges to S M . 

In particular, the sequence ^E M [|Z n |M„] converges as n tends to +oo and its limit is the non- 
diagonal term o/E^. 
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Proof: 



We recall the following classical version of the martingale central limit theorem: (see [13]). 

Lemma 3.3 Let (Cn)neN be a square integrable, centered martingale with respect to a filtration 
(^n)neN ; with stationary increments . Assume that 



almost surely, where a 2 is a deterministic real. Then the law of (;^=Cn) weakly converges to a 
centered Gaussian law with variance a 2 . 

Proof of Proposition 3. 2: 



Following [4], we first prove a version of Proposition 3.2 where \Z n \ —n£(fi; d) is replaced by 
an appropriate martingale approximation that we denote with (xn)- 

Let dT the Busemann boundary of T. We recall that the Gromov product extends to the 
boundary. 

Under P M , almost any trajectory (Z n ) n< z N converges to a limiting point in <9r, say Z^. This 
follows from assumption (BA) since then dT can be identified with the Gromov boundary of V 
and one knows that random walk paths almost surely converge in the Gromov compactification 
of a hyperbolic group, see [l]or [14]. The law of is called the 'harmonic measure'. We 
denote it with £ M . 

In [4] part 4, it is proved that there exists a bounded function u on <9r such that the sequence 



is a martingale under W for any h G dT. 

In the sequel, we shall assume that h is chosen according to the harmonic measure £ M and 
independent of the walk (Z n ) n& ?$. It then follows that (Xn)neN has stationary increments. It is 
also proved in Theorem 9 in [4] that the Lindeberg condition (3.16) is satisfied. 

Remark 3.4 The group V has a natural action on its boundary. For each k G dT, the sequence 
(Z~ 1 /c) nG N is a Markov chain with values in dT started at k. This Markov chain has a unique 
invariant probability measure, namely the harmonic measure Moreover, the Markov chain 
(Z~ l k) n& }i has nice mixing properties: see Lemma 4 in [4]- By choosing h according to £ M 7 we 
are actually considering the chain in a stationary regime. 

We shall not explicitely use these remarks but they lie at the heart of the proof of the central 
limit theorem in [4]. 

On the other hand, the sequence (M n ) ra6 N being a sum of independent bounded and centered 
random variables, is also a centered martingale with stationary increments under P' 1 satisfying 
condition (3.16). 

Thus we deduce from Lemma 3.3 that the law of the vector (Xn/y/n, M n / y/n) under W 
converges to a centered Gaussian vector. Indeed, one may apply Lemma 3.3 to the martingale 
(axn + bM n ) neN for any a,b G R. 




(3.16) 



Step 1: 



h(Z n ) - n£(fi; d) + u(h) - u{Z~ x K) 
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Let S M be the limit covariance matrix. For (a, b) G M 2 , we use the notation £ M (a, b) to 
denote the value of the quadratic form associated to S M evaluated at (a, b). We observe that 
since both martingales (xn) an d (M n ) have stationary increments, then E M is also the covariance 
(under W) of the vector (Xn/Vn, M n /y/n) for any n > 1, that is 

-E^[(a X „ + &Mn) 2 ] = (3.17) 

Step 2: 

In the above claims, we wish to replace Xn by \Z n \ — n£(/j,; d). We shall use the following 
Lemma 3.5 There exists a constant C such that, for all D we have 

^[(h, Z n ) td >D]< C- x e~ CD . (3.18) 
Inequality (3.18) holds uniformly in h. 
Proof of Lemma 3. 5: 

The statement of the Lemma actually directly follows from arguments in [6] . 
One may for instance split the event (h,Z n ) id > D into two, say A := ((h,Z n ) id > D) D 
((Z^, Z n ) ld > f ) and B := ({h, Z n ) id > D) n {(Z^, Z n ) ld < f ). 
Let us show that W[A] + W[B] < C' x e' CD . 

In the argument below T\ is a constant that depends on d and fi only. We choose D large 
enough and how large depends also only on the choice of the metric d and the measure ji only. 
In particular, we assume that D > 4r, where r is the hyperbolicity constant from (2.2). 

Hyperbolicity implies that, on A, we also have {h.Z^^ > y — T\ > y. We know from 
[6] Proposition 3.10 that £ M satisfies the doubling condition. Therefore the probability that 
(hjZ^id > y) can be compared to the harmonic measure of a ball of dT of radius of order 
e -CiD f or some some d, and since ^ is Ahlfors regular, see Theorem 1.1 in [6], we get that 
P"L4] < C- l e- CD . 

On the event B, we have (Z^, Z n ) id < y and \Z n \ > D — r > ^D. Therefore the distance 
between Z n and any quasiruler from id to Z^ is larger than ^ — T\. For large enough D, this 
last event has a probability bounded from above by C~ 1 e~ CD for some C as follows from the 
deviation inequality in Proposition 3.8 in [6]. Therefore W[B] < C _1 e _c * D . ■ 

Back to the proof of Proposition 3.2, we observe that h(x) = \x\ — 2(h,x) id for all h e dT 
and x G T. Therefore 

\Z n \ - n£(/i] d)-Xn = 2(/i, Z n ) id - (u(h) - u(Z~ l h)) . 

Using Lemma 3.5 and the fact that u is bounded, we get that 

W[\\Z n \-nt{n-d)- X n\ >D]< C-\- CD (3.19) 

for some constant C . 

As a by-product of (3.19), we get that -^{\Z n \ —n£(fi; d) — Xn) converges to in probability. 
Therefore the two sequences of vectors {{\Z n \ — n£(fi; d))/y/n, M n / y/n) and (Xn/ y/n, M n / y/n) 
have the same limit in law. Thus we have proved that {{\Z n \ — n£{\i\ d))/y/n, M n /y/n) satisfies 
the central limit theorem with asymptotic variance S M . 
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We also deduce from (3.19) that 

sup E"[(|Z n | - n£(fx; d) - X nf] < oo , (3.20) 

n 

and 

^[(\Z n \-n£(fx;d)- X n) 2 ]^0. 

Combining (3.20) with (3.17), we see that the covariance matrix of ((\Z n \—n£(fi; d))/*Jn, M n /y/n) 
under converges to S M . Also 

-W[\Z n \M n ] = -W[{\Z n \-n£{^d))M n ] 
n n 

converges as n tends to +oo and its limit is the non-diagonal term of S M . This concludes the 
proof of the Proposition. 

■ 

For further references, we observe that (3.20) with (3.17) implies the following 
Lemma 3.6 For all /j, G V S (S), we have 

sup -E^[(|Z n | - n£(n; d)) 2 } < oo . 



n 



n 



3.3 Proof of Theorem 2.3 and (3.15) 

Here T is any finitely generated group. 

We recall that we are assuming the joint Central Limit Theorem for the vector (\Z n \,M n ) 
under P°; namely the law of the two-dimensional random vector ((|Z n | —n£(fj,; d))/ ^Jn^ M n j ^Jn) 
under P° weakly converges as n tends to +oo to a centered Gaussian law with some covariance 
matrix S. Let a be the non-diagonal element of E. 

We also assume that 

sup -E°[(|Z n | - n£(no] d)) 2 } < +oo . (3.21) 

n 71 

We wish to prove that 

]im J-(E x [\Z n \]-E°l\Z n \]) = o-, (3.22) 

n^+oo,A-5>0 ATI 

along any sequence A such that lim sup n _>. +00 \ 2 n < +oo. Without loss of generality, we may 
and will assume that \ 2 n converges to some limit a > 0. Then A ~ \Ja/n. 

We start dealing with the case a^O. 

First note that 

E A [|Z Tt |] - E°Qjy _ E A [|Z n | - n£(fi ; d)] - E°[\Z n \ - n£(fi ; d)] 
\n An 
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From the Central Limit Theorem for \Z n \ under E° and assumption (3.21), we get that 
E°[\Z n \ - n£(pL ; d)} E°[\Z n \ - n£(fi ; d)] 



An y/ay/n 



-> 0. 



In order to compute the limit of E A [|Z n | — n£(/io; d)]/Xn, we write these terms in a form 
that is more amenable to the application of the Central Limit Theorem. 
We have 

^f X [\Z n \ - nib*; d)} = ^f[{\Z n \ - rd^ d))U] =1 ^l] . (3.23) 

Let a G S. Recall that we have a first order expansion of the function A — > /i\(a) in the 
form 

log^4 = Az/(a) + Ao A (a), (3.24) 

ma) 

where the function 0\ uniformly converges to as A goes to 0. 

Because ua is a probability for all A, it follows from (3.24) that v and o\ must satisfy the 
following centering conditions: 



^2v(a)fio(a) = (3.25) 

and lim ^ V(o A (a) + -z/ 2 (a))u (a) = . (3.26) 

With this notation, we re-write (3.23) as 

X -E\\Z n \ -n£(md)} = -^E°[(\Z n \-n£^ 0] d))e XM -' x2A ^}, (3.27) 



An An 



where 



n 1 ™ n \ 

$>P0) , A n :=^AXj) and i£ = A + -z, 2 (X,) 



:- , , , ., 

j=l j=l j=l 



From the Law of Large Numbers for sums of i.i.d. random variables, it follows that \A n al- 
most surely converges to | J2 aeS ^ 2 (o)/^o( a ) an d therefore A 2 A n converges to | J2 aeS ^ 2 (a)u (a). 
We claim that _R A converges to in probability under P°. 

The argument for this last claim goes as follows. Let Y x := jO\(Xj) + |z/ 2 (X,), so that 
i? A = A 2 Y^=i For a fixed n, the random variables (1^ A )™ =1 are independent and equally 
distributed. On the one hand (3.26) implies that E^Y^] tends to and therefore E°[.R A ] also 
converges to 0. On the other hand, the variance of Y x satisfies lim A ^. A 2 V [Y 1 A ] = 0. (Use the 
fact that o\ converges to 0.) Therefore the variance of i? A is of lower order than A 2 n and tends 
to 0. Thus we get that both the mean and variance of i? A converge to 0. 

We now use the Central Limit Theorem for ((\Z n \ —n£(fM)',d))/y/n,M n /y/n). 
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Ignoring for a moment that the function we integrate in (3.27) is not bounded, we pass to 
the limit using the relation \ 2 n — > a and get that 

-^-E x [\Z n \ -nlU Q -d)\ ->■ -^-E[Ze^ M -^^s- 2 (a)M^ ? ( 3 . 2 8) 
An Ja 



where (Z, M) is a centered Gaussian vector with variance E. 

Observe that the variance of M n equals n^2 aeS z/2 ( a )/ x o(o) and therefore J2 a &s z/2 ( a ) u o(o) 
E[M 2 ]. Thus the right hand side of (3.28) equals 



— E[Ze^ M -^ E[M ' 2] ] . 



The integration by parts formula for Gaussian laws implies that (for any Gaussian vector and 
any a) 

J-E[Ze^ M ^ E ^] = E[ZM] = a. 

Thus we are done with the proof of Theorem 2.1 once we justify that we may indeed pass 
to the limit in (3.27). In order to do so, it is sufficient to have bounds on the moments of the 
functions we integrate. 

Holder's inequality implies that 

E°[(^(|Z B | - n£(u ; d )) e ™»-A 2 ^+**)«/5] 

< E°[(-^(|Z„| - n£(/i ; rf ))2]6/10 E [e 3AM„-3A^ n+ 3^]6/15 
/Kit 

We already know from assumption (3.21) that p^p-E°[(|Z' rj ,| — n£(fi ; d)) 2 ] is bounded in n. 

Let us prove that E°[e 3AM »- 3A2i " +3 ^] is also bounded in n. 
Note that there exists a constant C such that 

jg0^ e 3AM n -3A 2 A„+3ii£j < jgO | e 3AM„+3i?£ j 
= e 3E°[^]] E 0[ e 3AM n +3(^-E ^])j < gCjgO ^3AAf„+3(fl*-E°[fl*])] _ 

(We used the fact that E°[.R A ] is bounded for the last inequality.) 
From the independence of the X/s follows that 

E 0[ e 3AM„+3(fli-E [fl*])] = E 0[ e 3Ai/(^i)+3A a (y 1 A -E°p^])]n _ 

But the random variables v{Xi) + A(F 1 A — E°[Y" 1 A ]) are centered and bounded (uniformly in 
A) i.e. there exists a number M such that \v{X\) + \(Y? - E ^])] < M for all A and all 
trajectory u. Therefore E [e 3Xu( - x ^ +3X( - Y ^~ EO ^ Y ^] n is bounded whenever A 2 n is also bounded. 
For this last step, we rely on the following classical Lemma, see part 7 of [8] for instance. 

Lemma 3.7 For all M and K there exist constants Cm and hq s.t. for all random variable X 
with \X\ < M and E[X] = and for all A and n > n s.t. \ 2 n < K then 

E[e xx ] n < e° M . 
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This completes the proof of (3.22) in the case «^0. 

The case a = is easier. As we did in (3.23) and (3.27), we start writing that 

i-(E A [|Z n |] - E°[|Z B |]) = ^f°K\Zn\ - n^ ; d))(e XM ^ x2A - +R - - 1)] . 
Using similar arguments as for the case a ^ 0, it is not difficult to show that 

lim ^-E°[(\Z n \ - netw, d))( e XM "~ x2An+R - - 1)] 
lim ^-E°[(|Z B | - rUto, d))\M n e- x2A " +R -] . 

rw+oo,A^0,A 2 ?w0 ATI 

Observe that 

^E°[(\Z n \ - niifjio; d)) \M n e~ x2 An+R ™] = -E°[(\Z n \ - n£(^ o] d))M n e- x ' 2A - +R -] . 

The limit of this last expression is given by the Central Limit Theorem and, with the notation 
we already used, it coincides with E[ZM] = a. Observe that, with our scaling satisfying 
A 2 n — > 0, we have \ 2 A n — > 0. The details are similar to the case a = 0. 

■ 

End of the proof of Theorem 2.1 

Proposition 3.2 and Lemma 3.6 from part 3.2 show that the assumptions of Theorem 2.3 
are satisfied. Thus we get that 

lim ^-m\Z n \] - E°[\Z n \\) = a(u, w d) . 

»w+oo,A— >0,A 2 ri— >1 ATI 

But we observed in part 3.1 that this convergence implies Theorem 2.1. ■ 



4 Proof of Theorem 2.2 

We now explain how to deduce Theorem 2.2 from Theorem 2.1. 

As in the proof of Theorem 2.1, we may and will restrict ourselves to positive A's. 

We first recall that the entropy can be interpreted as a rate of escape in the Green metric: 
h(fi) = £(fi; d G ). Thus we have: 

h(jM X ) - %o) = {h(jM X ) - £(fi x ; d° G )) + (1(jm x] d° G ) - % ; <f G )) . (4.29) 

By Theorem 2.1 and since d G satisfies (BA), once divided by A, the second term in (4.29) 
converges to a(u,/j, ; d G ). Thus the proof of Theorem 2.2 will be complete once we prove that 



lim \{h^ x )-£^ x] d G )) =0. (4.30) 

A^0+ A 

It turns out the estimate (4.30) does not use the hyperbolicity of T. We have the following 
more general property: 
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Proposition 4.1 Let T be a finitely generated group. Assume T is non-amenable. Let /i be 
a probability measure on T such that the support of /i generates T (as a semi-group), and 
H{n ) < oo. 

Consider a curve of probability measures on T, say A G [—1, 1] — > satisfying the Regu- 
larity Assumption: 

log fi X (a) = log no(a) + \u(a)/j, (a) + Ao A (a) , 

where v is bounded and o A (a) converges to uniformly in a G T. 
Then 

liml(%,)-%4))=0. 

Remark 4.2 We do not assume any more that fi or the fix 's are symmetric. Then the Green 
metric may not be a real distance. Indeed, although it still satisfies the triangle inequality, it 
may not be symmetric. 

Thus in this part of the paper, the word 'metric' will refer to a function on T x T that 
vanishes on the diagonal and satisfies the triangle inequality. 

The interpretation of the asymptotic entropy as the rate of escape in the Green metric 
remains valid in this general framework, see [5]. 



Proof: We give two separate arguments for lower and upper bounds for h{ji\) — £(fi\] d° G ). 
Both arguments are based on the so-called 'fundamental inequality' that we first recall: let fi 
be a probability measure on T with finite entropy and let d a left-invariant proper metric on 
T. We denote with v(d) the logarithmic volume growth of the metric d. The 'fundamental 
inequality' states that h(fi) < v(d)£(fi; d), see [11], [20], [5] and the references quoted therein. 

The 'fundamental inequality' in particular applies to any Green metrics <ig.. By a result in 
[5], we have v(<ig) = 1. Therefore we get that 

M/i A )=£(/i A ;4)<£(/i A ;^), (4.31) 

for all A and a. 

Applying (4.31) with a = 0, yields h(fi\) < d%) and therefore 

lim sup \ (h(pL X ) - £(fi x ; d° G )) < (4.32) 

It remains to prove the lower bound. We first need review properties of the Green metric. 
Consider a probability measure \x G V S (S). We recall that we defined the Green metric as 

d G (x,y) := logG"(id) - logG^x-'y) , 

where G^(x) = J2™ =0 H n (x) is the Green function of the random walk. 

We may equivalently express d G in terms of the hitting probabilities of the random walk: 
for a given trajectory oj G Vt and z G T, let 

T z (cu) = inf{n > 0; Z n (w) = z) 
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be the hitting time of z by u. Observe that T z (u) may be infinite. 
Define F»(z) := W[T Z < oo]. Then 

d G (td,z) = -\ogF»(z), 

as can be easily checked using the Markov property. 
In the sequel, we use the notation F x instead of . 

Lemma 4.3 The function (A, a) — > d G ) is bounded on [0, l] 2 : 

sup ^(/Ua; d G ) < oo . 

0<A<l;0<a<l 

Proof of Lemma 4 ■ 3 

Let /i and fi' be probability measures on T and let d be a proper left-invariant metric on T. 
It is clear that 

£(fx; d) < d(id, a)p,{a) . 
aer 

Also F tx '{a) > /i'(a), and therefore d G (id, a) < — log /i'(a). 
Applying these two inequalities to p, = p\ and p' = p a , we get that 

^\\d G ) < -^(log /i a (a))nx(a) 
aer 

= - ^(log Po(a) + av(a) + ao a (a))p\(a) . 

Since v and o A are bounded, we have p\{a) < e c p (a) for some constant C. For the same 
reason, the term XLer( az/ ( a ) + ao a( a ))/^A(fl) is also controlled by a constant. Therefore 

£(px;d G )<e c H(p ) + C, 

for some constant C. 

■ 

We shall need the following estimate on T z : 

Lemma 4.4 Let p e V S (S). Then there exists a positive constant k such that 

supE^[e KTz ; T z < oo] < oo . 
zer 

Proof of Lemma 4-4 

We use the non-amenability of T: there exists a constant p < 1 such that, for all n and all 
z G T, we have p n (z) < p n . Therefore 

W[e KT * ;T z <oo] = ^e Kn P"[T, = n\ 

n 

< W \ Z n = Z )=Y1 e "V(*) 

n n 
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< j2 eKn p n< °° 

n 

as soon as e K p < 1. ■ 
End of the proof of Proposition the lower bound. 
We use the shorthand notation F x (z) := F^ x (z). 
As in (3.23), we have 

F\ Z )=E°[Uj Ll ^l-,T z <oo]. 

Let us apply Holder's inequality with positive parameters (p,q,r) such that 1/p+l/q + l/r = 1 
and with the notation a = Xq. We assume that a < 1. Thus 

F\z) < F°(z)^F a (z)^E [( n^^^M^^V ; T, < oo] 1 ^. (4.33) 
Let a G S. Using the Regularity Assumption and the relation a = Xq, we get 

^a(o) / M a ) y/q = e X(o x (a)-o a (a)) _ 

Ho{a) n a (a) 

Therefore, since X < a < 1, and remembering that o x uniformly converges to 0, we see that for 
all e > 0, provided a is small enough then 

^M ( /^)i/ 9 < exp(eA) . (4.34) 
li {a) fi a {a) 

Using (4.34) in equation (4.33), we see that 

F x (z) < F°(z) 1 / p F a (z) 1 /' 1 E [e M ; T z < oo] . 

If we further assume that eXr < k°, where k° is the constant given by Lemma 4.4 when choosing 
\i = /xo, then we have 

F x (z) < CF°(z) 1/p F a (z) 1/q , (4.35) 

for a new constant C that does not depend on z. 

We evaluate inequality (4.35) at z — Z n ; take the logarithm and take the expectation with 
respect to P A to obtain 

E x [d x (id, Z n )\ > ±E A [d° (id, Z n )\ + -E x [d a G {id, Z n )\ - log C . 

Now divide by n and let n tend to oo, to get that 

h(fi x ) = £(» x ; d x ) > -£(/i A ; d° ) + -£(fi x ; d G ) . (4.36) 
p q 

We choose r = kP/(eX) and A small enough so that 1/r + \jq < 1. Then (4.36) becomes 

h(jM X ) - itjJK d° G ) > - (tfa d G ) - t(jM X] d° G )) - ^(/i A ; d° ) . (4.37) 
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We let A tend to in (4.37): by Lemma 4.3, we know that 

A(^ A ;<£)-^ A ;4))->0 

and \£(/j, x ;d G ) ->■ 0. Therefore 

lim inf (h(fi x ) - £(fi X ; d° G )) > . (4.38) 

Using the inequality £(fj,\;d G ) > h{ji\) (which comes from the 'fundamental inequality'), 
we deduce from (4.37) that 

\(h^ x )-l^ x -d G )) > -(% A )-%;4)) _ -i^ A ; <J° ) . (4.39) 

A OL K 

It follows from Lemma 4.3 that there exists a constant £q such that -^£{^\\ d G ) < £o for all A. 
By (4.38), the term h(/i\) — £(/i\] d G ) has a non-negative lim inf. Thus we deduce from (4.39) 
that 

lim inf \ (/i(/i A ) - £(/i A ; d° )) > -e£ . (4.40) 
And since (4.40) holds for any small enough e, we have 

lim inf | (h(n x ) - £(fix; d° G ))>0. (4.41) 
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